HEAVY TAILED BRANCHING PROCESS WITH IMMIGRATION 
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Abstract. In this paper we analyze a branching process with immigration defined recursively by 
At = 8t o Xt—i + Bt for a sequence (Bt) of i.i.d. random variables and random mappings 8t o 
x := 8t(x) = XliLl > w ith (Aj )igN being a sequence of No-valued i.i.d. random variables 
independent of St. We assume that one of generic variables A and B has a regularly varying tail 
distribution. We identify the tail behaviour of the distribution of the stationary solution Xt- We 
also prove CLT for the partial sums that could be further generalized to FCLT. Finally, we also show 
that partial maxima have a Frechet limiting distribution. 
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1. Introduction 



Branching processes with immigration are one of the corner stone models in applied probability and 
Markov chain theory on a countable state spaces. They were studied extensively in the literature, the 
classical references are: [44, 36, 27, 22, 6, 7, 8, 23, 31, 38]. The process considered in this paper appears 
also in various models in queueing theory (see [2, 3, 4, 5]), polling systems (with possible non-zero 
switch-over times; see [24, 25, 39]), infinite server queues (e.g. [20]) and processor sharing queues (e.g. 
[47]). This process can be also used as a model of packet forwarding in delay-tolerant mobile ad-hoc 
networks (see [20] for details). In the time series theory, the model considered here contains a class 
of so-called INteger AutoRegressive (INAR) processes, see [1]. Some possible applications of these 
processes, starting with medicine and biological sciences are discussed in [13, 17). The INAR processes 
were originally introduced as a discrete counterpart of AR processes. However, their nonlinear struc- 
ture represents a challenge for theoretical analysis. 

We study a process (X t )teti indexed over No = {0,1,2,...}. Here, Xq is a random variable 
independent of an i.i.d. sequence of random mappings (6t)tei,, &t '■ No ~> No defined as follows: for 
each i€Z and for a sequence (A^ )jeN °f No-valued i.i.d. random variables, we have: 



That is, 9 t maps an integer x into a random integer with an interpretation that each of x individuals 
in the (t — l)th generation leaves behind a random number of children, and all these numbers are 
independent and have the same distribution as some generic random variable, say A. To introduce 
immigration in the model, we assume that another i.i.d. sequence (B,B t ,t G 1) of No-valued random 
variables is given independently of the sequence (#t)tez- Then (X t )teTi satisfies: 



X 



:= e t ( x ) = J24 



x G N . 



(1) 



x t = e t ox t _ 1 + B t 



for each t > 1 , 



or in an alternative notation 




for each t > 1 . 



Observe that the random mappings 9 t by definition satisfy 



t o (x + y) = 6 { t 1} o x + 6 ( t 2) o y , 
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where 9 t l , i — 1, 2 on the right-hand side are independent with the same distribution as Q t - 

Denote by / and g the probability generating functions of A and £?, respectively, i.e. 

f(z)=E(z A ), g(z)=E(z B ). 

Following Foster and Williamson [21], the Markov chain (X t ) is ergodic with unique stationary distri- 
bution if and only if 

Jo f{s) - s 

In terms of moments, sufficient conditions are given in [43]. If 

(2) 0<fx := E{A) < 1 and £(ln(l + B)) < oo , 

then the chain is ergodic with unique stationary distribution and even strongly mixing. From now on 
we assume that (2) holds. 

The unique stationary representation of (X t ) t< £n is given by 

oo oo k— 1 oo 

(3) X t £ B t + J2 0t k) ° ■ ■ ■ ° €~kU B ^) °t?\B t _ k ) =: £ C t , fe . 

fe=l fc=l i=0 fc=0 

Since (0^) are i.i.d., it follows that (Ct.k)kefi 18 another sequence of independent random variables. 

In [43] one can find corresponding representation of the stationary distribution in terms o probability 
generating functions. For completeness, we show in Lemma 2.1 that the random series above converges 
with probability 1. From there, it is straightforward to check that such X t satisfy (1). 

We will also consider a Markov chain (X^) tG ^ which evolves as 

X' t = max{6> t o X' t _ l: B t } for each t > 1 . 
The unique stationary distribution of {X' t ) exists since X' t < X t and it is given by: 

(4) X' t = sup{B tl e { t k) o...a ef-£l x {B t _ k ) : k = 1, 2, . . .} = sup{C t , fe : k > 0} . 

Since X[ are dominated by X t in (3), the supremum above is a.s. finite. Direct calculation shows that 
{X' t ) define strictly stationary sequence. 
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In this paper wc identify the tail behaviour of the distribution of the stationary solution X t under 
assumption that the generic size of immigration B or generic size of offsprings A has regularly varying 
distribution. Although there is an extensive literature on behaviour of heavy tailed compound sums 
(see e.g. [16] and reference therein) or heavy tailed random difference equations (see [2H] and references 
therein) those results do not seem to produce in a straightforward manner the asymptotics of stationary 
distribution for branching processes. Surprisingly, literature on tail asymptotics for branching processes 
is limited; see e.g. Corollary 2 in [16]. 

We also prove CLT for the heavy tailed partial sums that could be further generalized to FCLT (see 
e.g. [46]). Furthermore, we show that partial maxima have Frechet limiting distribution. 

The paper is organized as follows. In Section 2 we find the tail behaviour of stationary sequences 
(Xt)t£fi an d (X()tgN under two different regimes. We use this result in establishing CLT for the 
process (X t ) in Section 3. 

2. Tail behaviour 

2.1. Regularly varying immigration (Model I). We will assume that A and B satisfy the following 
conditions: 



for some a G (0, 2) and a slowly varying function L. We consider here the case a G (0, 2) only. 
For a > 2 proofs become much more involved, however, a technique is clearly suggested by the case 
a G [1,2), see Remark 2 below. For a G [1, 2), we also assume that 



(■3) 



< fi = E(A) < 1 , 



(6) 



P(B >x) = x- a L{x) , 



(7) 



E(A 2 ) < co. 



In particular, it means that the tail of A is lighter than that of B in the sense that 



(8) 



P(A >x)= o(P(B > x)) 
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The conditions above are needed, in particular, to conclude that 
(9) 



where Ai are i.i.d. copies of A, independent of B. To establish (9), in [19] the authors assume (5)-(6) 
and (8) together with E(B a ) < oo if a = 1; see Prop. 4.3 in [19]. On the other hand, in [42, Theorem 
3.1] the authors assume that B is consistently varying (which is implied by (6)), E(A r ) < oo for some 
r > 1. Furthermore, they assume that either 

(10) E(B) < oo together with P(A > x) = o(P(B > x)) 
or 

(11) E(B) = oo together with xP(A > x) = o(P(B > x)). 

We note that the latter follows from (7) and the fact that mean of B is infinite. 
In the sequel it is useful to introduce the following random variables 

(12) i (fe) = 6 k o ■ ■ ■ o 6i o 1 

and assume that for each k an i.i.d. sequence (A^ )igN is given with the same distribution as . 
Note also that A^ = A. In general we have 

A 

A {k) £ s PA {k ~ 1) 

1 ' 4 3 

3=1 

with all the random variables on the right hand side being independent. From the above identity and 
E(A) = fx we have E(A\ ') = fi k . Moreover, for Ct,k defined in (3) it holds 



(13) c t . M = A 



Bt-k 

(fc) 

3 

3=1 



We start our analysis by showing that under our basic assumption (2) the random series in (3) 
converges a.s. and hence X t in the same expression are properly defined. 



Lemma 2.1. Assume that (2) holds. Then J^kLo ^t,k converges a.s. 
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Proof. Take an arbitrary e > such that /j, + e < 1. Observe that 

(Bt-k 
E 4 fc) ^ 



< P(B> (M + e) _fe ) +^ ( E ^f )>£ 

< P(ln(l + B) > -fcln^ + e)) 



where the last expression follow by the Markov inequality and E{A^') = fx k . Since .Eln(l + B) < oo, 
the sum of probabilities above for k = 1 to oo is finite. Hence Borel-Cantclli lemma shows that with 
probability 1 only finitely many Ct^'s in (3) are different from 0. □ 

Condition (6) ensures that the upper tail of the immigration distribution is regularly varying. We 
first show that this property is inherited by the random variables X t and X' t introduced in (3) and (4). 

Theorem 2.2. Under the conditions (2), (5)-(7) and (10) or (11), we have 
(14) ^ P ^ >X }=^ P ^ >X l=t^- 

V ' x^oo P(B > X) x-i-oo P(B > X) 

y ' K ' fc=0 

Remark 1. Our model is X t = J2f=i 1 + Bt- If we assume that random variables Af' are equal to 
/i G (0, 1), then we obtain X t = iiX t -i +B t , that is the AR(1) model. For such models it is well-known 
that 

oo 

P(X t > x) ~ e ^ kap ( B > x )- 

k=l 

Hence, the tail of the branching model with random offspring A *' is the same as that of the averaged 
AR(1) model. 



2.2. Proof of Theorem 2.2. For r, k = 1, 2, . . . denote 

m r (k) := E(A[Vy = E (E 1 ^) , 

with a convention A\ = Aj, where Ai arc i.i.d. with the same distribution as A. The following lemma 
turns out to be very useful in the sequel 
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Lemma 2.3. For k > 2 we have: 



(15) E[A[ k) ) 2 <E(A 2 )(k + l) f i k . 



Moreover, if E{B 2 ) < oo, then 



2 



E I J2 4 'J <£(5)£(A 2 )(fc + lV fe + £(,B 2 )/j 
Proof. The proof of (15) is by induction on fc. For fc = 2 we bound 

where all random variables A, Aj are i.i.d. Simple conditioning argument yields the bound 

E(A)E(A 2 ) + {E(A)) 2 E(A 2 ) = E{A 2 )n(l + fi) < 2E(A 2 ) f i 

that matches (15) for k = 2. 
For k > 3 we have 

m 2 {k) =Y, E \ it, A ?~ 1] I ^ = ") = 



= E^ 4 = -)EM i r 

n j = l 



< £ , (A)£ , (A 2 )fc^ fe_1 + £(A 2 ) M 2fe < £(A 2 )(fc + 1)/ 



Furthermore, 



, 2 



s ( E A ? ] I ^ E = -)^(4 fc) ) 2 + E n2p ( B = ")^(4 fc) ) 2 

^' = 1 J n=\ n=\ 

< E(B)E(A 2 )(k + l)fi k + E{B 2 )ii 2k . 



This completes the proof of Lemma 2.3. 
Proof of Theorem 2. 2. 
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Step 1: Large deviation results, such as those developed in [35], [19] (see also [42]) yield that under 
the conditions (5)-(7) and either (10) or (11) we have, 

P(p t (Bt-i)>x)~P(B>x/iJL). 

By induction, 

(16) P(Ct,h >x)=P (jg) 6^ k) (B t -k) > xj ~ P{B > x/n k ) . 

Thus, each of Cj k > 0, is regularly varying and since they are independent, the random variables 

n l 

k=a 

and 

x t,m = max{C tife : k = 0, . . . , m} 
both have the same property and satisfy 

(17) hm 3ftm>f) = Um P(K m >x) = « 
V ' x->oo P(B > x) *->oo P{B>x) 

see [41] for instance. 

Step 2: To bound liminf in (14) from below note that: 



>oo P{B > X) - x^oo P(B > x) 

Therefore, 

Um PiEZo Ct,k > X) > y, 

^->oo P(B > x) ' 

v ' k=0 

by applying (17) and letting m — > oo. 

Step 3: To establish an upper bound for the tail of X t in (14) it is enough to show that 

na\ v r P^Zkp °t,k > x) 
18) hm hmsup — — ^ : = 0. 

k ->oo P(B > X) 

Similarly, to obtain an upper bound for the tail of X' t , it is enough to show that 

, 10 , ,. ,. -P(sup fc>fco C t ,k > x) 
(19) hm hmsup — f- r = 0. 

fco->oo x^too P(B > X) 

However, once we show (18), (19) follows immediately. 
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(20) 



Observe that: 

P(EZ ko c t , k > 



< 



P(B > x) 

^(sup fc > fco B t ^k > X (l - £)//') P(J2V=k Ct,k\ Bt _ k <x{l-e)/^} > X) 



P(B > x) P(B > x) 

The first term on the right hand side is bounded above by: 

P(B > x{l - e)/n k ) 



E 



P(B > x) 

k=k y ' 

One can use the Potter's bounds (see [41]) now to see that its limit is zero if we let first x and then fco 
converge to oo. The second term is more difficult to handle, hence we split the proof in different cases 
with respect to the value of a. 

Case < a < 1: For the second term in (20) we apply the Markov inequality to bound it by 

V 2 - fc E ( B t-kl{B t _ k <x{l~e)/n k }) 

h xP{ - B>x ^ 



By Karamata's theorem in combination with the Potter's bounds each summand above is bounded by 

(1 + e)/ 



1 fl — s 



1-a \ /i fe 

for x, k large enough. Hence, we observe that (18) holds once we take limfe _ i . oo limsup^,^^ in (20). 

Note that a nonnegative random variable Y is regularly varying with index a if and only if Y J , j > 
is regularly varying with index a/j. This remark turns out to be very useful in the rest of the proof. 

Case 1 < a < 2: Note that (18) is equivalent to a requirement: 

r r P(ZZ k0 Ct, k >VZ) .. .. ^((E^ fco ^) 2 >*) n 

lim iimsup r =r = lim limsup — === - = 0. 

fc ->oc 2,^00 P(B > y/x) fc -Kx> x -*oo P(B Z > x) 

Repeating a similar argument as for a G (0, 1), we obtain 
P{B 2 > x) 

P( S up k > ka Bl k >x(l~e)/^) P ((ZZ k0 C t , k I {BLk<x{1 - e]/ ^ } ) >x 
P{B 2 >x) P{B 2 > x) 



in 
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=: Ji(k ) + J 2 (k ). 



The first term could be treated using the Potter's bound since B 2 is regularly varying. Using Markov 
inequality, the second one can be bounded above by: 

E (Efclfco Ct,k%{Bl_ k <x(l-e)/u,2>°}) E (j2T=k C tk I {B 2 _ k <x(l-e)/^ k }) 

xP(B 2 > x) - xP(B 2 > x) + 

E \ Y^f,l=ha Ct,kCt,ll{B2_ k <x(l-s)/n 2k }^{B2_ l <x(l-s)/n 2k }) 

+ xP(B 2 > x) 

Since by Lemma 2.3: 

m 2 (fc) = E (i (fc) ) 2 < C(k + l)fi k , 

it follows that 

/ \ 2 

E E^M = XX^f) +Y / E(A^)E( K A\ k ^<Cn(k + l)^+n 2 n 2 



Therefore, 



Ji(fco) 



E (j2T=k a C t,k l {B 2 t _ k <x(l-e)/^ k } 



xP(B 2 > x) 

* E 



E«<(x(i- e ))i/2/ M fc E fe"=i ^ fe) ) P(Bt-k - n) 



k — kn 



P(B 2 > x) 



- v "I" )V x p[B 2 >x) ^ 1 xP(B 2 > x) 

k=k v ' k=k y ' 

= Jll(fco) + Jl2{ko) ■ 

Since B 2 is regularly varying with index a/2 € (0, 1), Karamata's theorem applies again and we finally 
have 

lim limsup Ji2(fc ) = 0. 

/c — yoc ^—^oo 

If E{B) < oo, then Jn(fc ) is bounded by 



k—kn 



and hence goes to as x -> oo. Otherwise, if E(B) — oo (hence a = 1), then we write 

(2D WM^EIH.) /^''' 1 ' 
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Now, we note that 

rv 



E(Bl {B < y} ) < [ P(B> u)du =: L{y). 
Jo 



Theorem 1.5.9a in [12] yields that L(y) is slowly varying. Furthermore, the Potter's theorem yields 
that for any chosen constants A > and 5 > 0, 



< A max ' 



(!)' 



-s 



L(z) 

as long as y and z are sufhciently large. Hence, for a sufficiently large x, 

E (M {B<V5VI= 5 /Mfc} ) < L(V^VT^/ti k ) < CL(^)max{^, M - M }. 

Thus, 

k — ho 

The series is summable if we choose S S (0, 1). Now, lim^-j.oo ^.p^^S,^ = 0, since L is slowly varying 
and P(B > y/x) is regularly varying with index —1/2. 

Likewise, 



< 



xP(B 2 > x) 

Y^'= k o E (( fc + l )^ k Bt-k^{B 2 t _ k <x(l-e)/ti 2k }) E {O- + 5 t-iI{S t 2 _ ! <a;(l-e)/ A1 2fc }) 



xP(B 2 > x) 

Again, if E(B) < oo, then the term is bounded by 
and hence 

lim limsup J2(fc ) = 0. 

/Co ►OO x — yoQ 

Otherwise, if E(B) = oo, we apply the same argument as for Jn(fco). 



Remark 2. Let us briefly discuss the case of 2 < a < 3 (the same applies to all a > 2). To prove 
Theorem 2.2 for this range of a it suffices to show that: 

,. ,. P(EZ k0 c t , k > x^) P((EZ k0 Ct, k f > x) 

hm limsup — —:- = lim limsup „ . " r = U. 

fco^oo x ^o^ P(B > X 1 / 3 ) fco^oo ^-coo P(B 3 > x) 
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Note that 



P(B 3 > x) 

< P(su Pfe > feo B^ k > X{1 - e)/n Sk ) P {{^k Ct.kJ{Bl_ k <x(l-e)/^}) > 



3 

X 



P(B 3 > x) P{B 3 > x) 

The first term in this bound is handled as before. For the second term apply Markov inequality and 
note that: 

E E C t,k l {B3_ k <x{l-s)/^} ] 
\k=k a J 



;)/V 3fc } 



- [J2 E ( C lk l {B-i_ k <x(l-e) 
k—ko 

oo 

+ 3 E E ( C fi I {B, 3 _ t <i(l^)/c 3 '} C ' t J I {B t 3 _ i <i(l^)/l' 3i }) 

k,j = k 
k^j 

OO 

^ E ( y C t,k^B^_ k <x(l-e)/fi 3k } C t,j\Bf_ J <x(l-e)/ l i^}Ct^B^_ l <x(l-e)/^ 1 }^ 



k,j,l = kQ 
k^j^l 



=: Li(fc ) + L 2 (k a ) + L 3 (k ) . 
Now, L 3 {ko) can be bounded above by: 

oo / oo N 

{E{B)f ( k + + W + ^ ( E ( E )) 3 E ^ 



k,j,l = kQ \k — kn 

k^j^l 



Hence lim^-^ lim sup^,^^ L 3 (k^)(xP(B ? ' > x)) 1 = 0. To deal with ii(fco) and L2(&o) we proceed 
in the same way like in the step 3 of the proof of Theorem 2.2. We only need a bound similar to that 
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of Lemma 2.3. Using similar arguments like in the proof of Lemma 2.3 observe that: 

m 3 (fc) < fiEiA^-V) 3 + 3fiE(A 2 )E{A ( - k - 1) ) 2 + E(A 3 )fi 3k 

= (i {/^(i^" 2 )) 3 + Z^EiA^EiA^f + EiA 3 )^- 1 ''} 

+ 3[iE(A 2 )E(A (k - r >) 2 + E(A 3 )n 3k 
= n 2 [^{A^) 3 + ^E(A 2 )E{M k ^) 2 + E(A 3 )fi 3{k - 2) } 

+ 3[i 2 E(A 2 )E(A( k ~ 2 )) 2 + 3tiE(A 2 )E(A ik -V) 2 + E(A 3 )fi 3< - k -^ + E(A 3 )fi 3k 



k-1 k-1 

= Lt k E(A {0) ) 3 + 3 ^ EiA^EiA^-^f + E(A 3 ) ^ n 3 ^-^ . 

i=i 3=1 

Using Lemma 2.3 produces: 

fe-l 

m 3 (fc) < n k E(A 3 ) + E(A 3 )kpL 2k + 3E(A 2 )E(A 2 )Y,V J (k- j + l)n k ~ j 

3=1 

< fx k E(A s ) + E(A 3 )k f i 2k + 3E(A 2 )E(A 2 )k 2 Li k . 

Similar computation can be done, in principle, for arbitrary m r (k), r > 4. We note in passing that 
classical inequalities, like Rosenthal's inequality, do not seem to be applicable here. 

2.3. Regularly varying offspring (Model II). Throughout this subsection we will assume the 
following conditions: 

(22) < n := E(A) < 1 , 

(23) P(A > x) = x~ a L(x) , 

for some a <G (1,2) and a slowly varying function L. We consider here the case a € (1,2) only. For 
a > 2 the proof of the main result of this subsection could be adopted along the lines of Remark 2. 
We will also assume that the tail of B is not heavier than that of A in the sense that 

(24) lim^4=c, 

V ^ x^oo P(A > X) 

where c is finite (possible equal 0) constant. If c > we need also to assume that B is consistently 
varying. In particular, we note that (24) together with (22) implies that E{B) < oo. 
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From [16], using induction, we can conclude that: 



P(A {k) > x) ~ /iP(l (fe - x) > x) + P (a > 



f iP{A ( - k - 1) >x) + ^ k - 1)a P{A > x) 

A1 2p ( ^(fe-2) > x j + w (fc-2)ap( 4 > x) + ^(i-ljap^ > x ) 
fe-1 

^(fc-l)a ^ ^(!-«)p(A > x ) =■ d k P(A > X). 



3=0 

Hence, using again [16], we get: 

/fc-i 



P(C t , k > x) = P |^(g) 9fZi\B t - k ) > xj ~ ^(i?)P(i( fc ) > x) + P(B > x/EAW) 

~ E(B)P(A^ >x)+ c(EA^) a P(A > x) 
~ (E{B)d k + cE{A^) a ^j P(A > x) 
(25) = (E(B)d k + cfi ka ) P{A > x) =: ^, k P(A > x). 

Theorem 2.4. Under the conditions (22)-(24), we have 

V ' x^oo P(A > X) x^oo P(A > x) 

Proof. We will follow the arguments of the proof of Theorem 2.2. In first step one can observe from 
(25) that 

P(X t , m > x) P(X' tim > x) ™ 
urn — r — = hm H r — = > w k . 

x~>oo P(A > X) x^oo P(A > x) £z£ 

Step 2 is the same like i the proof of Theorem 2.2. Step 3 concerns the proof of equality: 

t07\ V V P (S^fc C t,k > X) 

(27) hm hmsup — f = 0. 

fco->oo x -*ao P(A > X) 

Similarly, one will obtain an upper bound for the process (X' t ). Now, 

P(Ek =kQ Ct,k>x) 



< 



P{B > x) 

-P(sup fc > fco B t _ k > x(l - e)/n k ) P(J2kLk c t,k\ Bt _ h <x{i- s )/^} > x ) 



P(A > x) P{A > x) 

The first term on the right hand side is bounded above by: 

^ P{B>x{l-e)/n k ) ^ P(A>x{l-e)/» k ) 

^ P(A>x) ~ [ ' ^ P(A>x) 

k=k v ' k=k v ' 
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Using the Potter's bounds we can conclude that this term goes to as first x and then ko go to oo. 
To prove that the second term on the right hand side goes to as first x and then fco g° to oo we can 
proceed in the same like in the proof of Theorem 2.2 using a fact that lim^^oo xP(A 2 > x) — oo for 
1< a < 2. □ 



3. ASYMPTOTICS FOR SUMS AND MAXIMA 

Throughout this section we assume that (Xt)tez is a stationary process satisfying (1) with distri- 
butions of A and B satisfying assumptions (5)-(7) of Model I (with a ^ 1) or (22)-(24) of Model II. 
In either case, by the results of Section 2, the distribution of each X t is regularly varying with some 
index a > 0. 

Remark 3. The sequence (Xt) can be shown to satisfy the well known drift/majorization criterion 
for geometric ergodicity of Markov chains (cf. [34] or [29]). For < e < min{l, a} for instance, the 
function V : No — > [l,oo) given by V(x) = 1 + x E , satisfies drift condition (5) in [29], where for small 
set C one can take a set of the form {0, 1, . . . , M} with integer M large enough. This further means 
that process (X t ) is strongly mixing with a geometric rate (see Theorem 2 in [29]). 

We argue next that the stationary time series (X t ) is jointly regularly varying, i.e. all its finite 
dimensional distributions are regularly varying. Namely, for a random variable Yo with Pareto distri- 
bution P(Fo > y) = y~ a for y > 1 and a sequence Y„ = Yo^i n , n e No, the following holds. 

Lemma 3.1. Under assumptions of Model I or Model II as x — > oo, 
(28) ((x- 1 X„)„ 6No \X >x)^ (F„)„ eNo . 

Here we use "— ^V' to denote convergence of the finite-dimensional distributions. In the language 
of [11], (Y t ) represents the tail sequence of the sequence (X t ). 

Proof. This follows immediately, since by regular variation of Xq and the law of large numbers, as 
x-^f oo 

C(X /x | X a > x) A C(Y ) 

£(VE£if ) l^o>^) ^ V- 
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Hence, (28) follows by Slutsky argument noting that for t > 0: 

t-l x 

(29) X t = £C t , fe +£If\ 

k=0 i=l 

where Ct^-'s are defined in (3) and -A^'s in (12), with all the random variables on the right hand side 
being independent. □ 

Denote in the sequel by (a n ) a sequence of constants such that for any u > as n—> oo 



(30) 



nP(Xo > a n u) — > u 



It exists and tends to oo by the regular variation property of the random variables X t . We observe next 
that the strong mixing property of the process (Xt) implies condition A'(a n ) of [10]. The condition 
states that for some sequence of integers r„— > oo, r n = o(n), denoting k n = \n/r n \ , as n oo we have 

k n 



(31) 



E 



exp 



T-f \™ fin 

2—1 X 



He cxp^-^r/- 



fe=i 



— ' a, 



0. 



for every / £ C^([0, 1] x R \ {0}), where C^(E) is the space of all nonnegative continuous functions 
on E 1 with compact support. Moreover, by the geometric decay of mixing coefficients (see Remark 3) 
such a sequence (r n ) can be taken to be of the order o(n £ ), for any e £ (0, 1). 

Finally, the structure of regularly varying process (Xt) in either of two regimes (Model I and II) allow 
us to apply Karamata's theorem and show that the process satisfies another well known condition in 
the literature. It is called anticlustering condition by [15] and [9], or finite mean cluster size condition 
in [10]. 



Lemma 3.2. Under assumptions of Model I or Model II with a/1 w have 



(32) 



lim lim sup P max Xt > ua r , 

\m<\t\<r n 



Xq > ua 7 



0. 



Proof. By stationarity 



lim lim sup P max X t > ua n 

m-s-oo n-j.oo \m<\t\<r n 



As we have seen in (29) for t > 



Xq > ua n I < lim lim sup 2 > P[ X t > ua n 



X > UCL n 



fc=0 



1=1 
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Since Cj t > 1, fc > 0, is independent of Xg, we note that: 



\k=0 



Therefore, since r n = o(n), it follows that: 
(33) 



lim limsup p[ X t > ua n X > ua n ) < lim limsup P( > 



t — m 

However, for a > 1, by Markov inequality: 

x 



t—m x i — l 



x 



Xq > ua, 



P(Ef>y 



t E { X ^{X >ua n }) 



P(X a > ua n )ua n 

Using Karamata's theorem, note that the right hand side converges to 2^ (a — l) -1 as n— > oo. There- 
fore, (32) holds for a > 1 and both Model I and II. 

For Model I and a < 1, we first observe that: 
/ x a 



(0 



> x\ <P Xn> 



(l-s)x 



fc<J 



By Markov inequality, the last sum above is bounded by 

£(^oI {Xn <(i^)i } ) 



Clearly 



/ x 

\i=i 



X > ua n < 



P (X > ua n ) 



Hence, the summands on the right hand side of (33) can be bounded from above by 



P X > 



(1 — e)ua r , 
2f? 



E(X I 



P (X > ua n ) 



{ x <a^ } ) 



P{X > ua n )ua n /2 



Now, if a < 1, apply Potter's bound together with Karamata theorem to conclude that after summing 
these terms for t = m, . . . , r„, we may let first n— > oo and then m— > oo to obtain limit 0. If a = 1, 
then we follow the same argument as in the proof of Theorem 2.2. We have 



E(X l {Xo <y}) < / P(X > u)du =: L(y), 
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where L(y) is slowly varying. Hence, by Potter's bounds for each S > there exists C = Cg such that 
E { X l {Xo< _^ } ) t 



P(X Q > ua n )ua n /2 P(Xq > ua n )ua n /2 



^ C ^ P(Y To max{(2^) 5 (l - e)- 5 , (2^)^(1 - e) 5 }- 

r(A > ua n )ua n /2 



□ 



Remark 4. We note that the above argument does not work for a = 1. Indeed, Theorem 1.5.9a in [12] 
implies that 



L(y) zjP(X > y) 

3.1. Point process and maxima. Lemmas 3.1 and 3.2 allow us to describe the asymptotic behavior 
of the following point processes 

n 

(34) N n = ^2 x,/o„) for all n G N. 

t=i 

We recall the basic notions of point processes theory, for a good introduction we refer to [40] or [41]. 
Let £ be a locally compact Hausdorff topological space and let M p (E) be a space of Radon point 
measures on E. The space M p (E) is equipped with vague metric d(-, •). We say that a sequence 
fx n S M P (E) converges vaguely to fx € M p (E) if J B /<i^„ — > J E fdfi for all nonnegative continuous 
functions on E with compact support. Once the state M p (E) is endowed with a vague topology, one 
can study distributional limits of its random elements like N n . 

It turns out by Theorem 2.3 in [10] that there exist a point processes N^ u ' , u > on the space 
[0, 1] x (u, oo ) with compound Poisson structure such that as n—> oo 



(35) N n 



4 

[0,l]x(«,oo) 



By the same theorem in [10] the extremal index of the stationary sequence (X t ) (see [40] for instance) 
equals 

(36) 6 = P(supK t < 1) = P(Y Q < 1/n) = 1 - n a > 0. 

»>i 

Therefore, partial maxima of the process (Xt) converge to the scaled Frechet distribution. 
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Corollary 3.3. Under assumptions of Model I or Model II with a^l, as n— > oo it holds that 

P (^<x) ^exp(-(l- , 
for every x > 0, where M n = max(Xi, . . . , X n ). 

However more precise statement on the behavior of large values can be made. For instance, the 
following proposition describes the clustering of large values in the sequence (X t ) and it follows from 
(35) by the similar token as corollary above (see e.g. [32]). 

Proposition 3.4. Under assumptions of Model I or Model II with a I there is a compound Poisson 
process N° on [0, 1] such that 

n 

(37) N° = S±kx i>an } ^N°, oo. 

i=l 

The proof of the proposition follows directly from (35) by Theorem 4.2 of [30]. Moreover, the 
limiting process N° has the following representation 



7V° = ]TMt,, 



where St { is a homogeneous Poisson point process on the interval [0, 1] with intensity 9 and (fvj)j>i 
is a sequence of i.i.d. random variables with values in N independent of it. Finally, random variables 
Ki have geometric distribution since by Theorem 2.3 in [10] 

P( Kl =k) = \[P >0:Yj>l}\ = k) -P(\{j >0:Y j >l}\ = k+ 1)] = ^^(1 - M "«) 

for all k € N. 



3.2. Partial sums. In the case a > 2, which we do not consider here in detail (see Remark 2), one can 
show that the classical central limit theorem for strongly mixing sequences due to Ibragimov applies 
(see [29], Theorem 5). Namely, as n— > oo 

4= (S n ~ nEB/{\ - fx)) ^ N(0, a 2 ) , 

where a 1 = E(Xo) 2 + Y^Li E(XoXi) < oo and S n denote partial sums of the process, i.e. S n = 
X\ + • • • + X n , n > 1. In the case < a < 2, X^s have infinite variance and an alternative limit 
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theorem holds. That is, there exists an a-stablc random variable S a such that properly centered and 
normalized partial sums converge to S a . 

Under the assumptions of Model I, for a £ (0, 1), using Theorem 3.1 in [15] together with lemmas 3.1 
and 3.2 and the strong mixing property of the stationary sequence (X t ) (cf. Remark 3), one can deduce 
the following result 

Sn d „ 

>S a . 

a n 

Similarly, when a £ (1, 2) under assumptions of either Model I or II, strictly stationary sequence (X t ) 
satisfies 

Sn ~ E ^\^<i} d c 

^ £>a j 

a n 

however, an additional technical condition is needed, sec Theorem 3.1 in [15] and condition (3.2) 
therein. In cither case the limit S a is an a-stablc random variable with a Levy triple (0, 0, r) identifiable 
as in [9]. Using results in [10] (cf. [33]) it seems that with an additional effort, one can also show a 
functional limit theorem for the partial sums under similar conditions, but we do not pursue that here. 
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